O 
CD 
Q 
Ui 
M 
Lfi 
hD 

b 

! 5 i ! 
{ fa* 

! E ] 

«:V 
,:ts. 
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SEQUENCE LISTING 

<110> Wiener, Kryzysztof B. 
Hoist, Olof Peder 
Hachem, Maher Youssef Abou 
Karlsson, Eva Margareta Nordberg 
Hreggvidsson, Gudmundur 0. 

<120> Thermostable Cellulase 

<130> P5099PC00 

<150> PCT/IS01/00012 
<151> 2001-06-15 

<150> 09/594,884 
<151> 




<160> 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 1877 

<212> DNA 

<213> Rhodothermus marinus 
<220> 

<221> CDS 

<222> (710) . . (1495) 

<223> 

<400> 1 



gggcgagtcg 


aacgtcgggc 


gcgcctcgca 


gatcgaacgg 


ctcttcccac 


gctgattcgc 


60 


tgcgtgcgca 


cgcactgggc 


gctgggcgag 


tttcccttct 


acttcgtcca 


gattgcccct 


120 


tacgactacg 


ggccgaacgc 


ccgcacgcag 


cgtgtccggg 


aagcgcagtt 


gcgcaccatg 


180 


tgcgcgtgcc 


tcatacgggt 


atggtcgtga 


ccaccgatgt 


gggcgacgac 


cacaacacta 


240 


cctccggcgc 


gacaacgtga 


a 9t99C9 a 9 c 


gcctggcctc 


tgggcgctgg 


ccaacgacta 


300 


cggcttccgg 


gatgtggtct 


actccggtcc 


catctacgaa 


cgcatggaac 


gtgaggatgg 


360 


ccgcctgcgg 


ctctatttcc 


gctatgccga 


agggggactg 


gtcctgcgcc 


cggccgacgg 


420 


cggcggcgag 


ttcgtgattg 


cgggaccgga 


ccgtgtcttc 


cacccggcgc 


gggtacgggt 


480 


cgaaggcgaa 


acgctggtgg 


tgtggagtcc 


gcgcgtctcc 


gatccgcagg 


cggtgcgcta 


540 


tggctggtcc 


aatacgccgc 


acgccacgct 


gttcaaccgg gccggcctgc 


ccgcttcgcc 


600 


cttccgcacc 


gacgactggc 


cggaaggcga 


ctgagcgacg 


caaccggtgc 


ttgcatgcga 


660 


caggggcact 


tcgtaccttg 


aagtgccctt 


tgtcatttca 


atggaataa atg aac gtc 
Met Asn Val 


718 
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atg cgt gcg gta ctg gtc ctg age ctg tta ttg ctg ttt gga tgc gac 766 
Met Arg Ala Val Leu Val Leu Ser Leu Leu Leu Leu Phe Gly Cys Asp 
5 10 15 

tgg etc ttt ccc gat ggc gac aac gga aag gaa ccg gag cct gag ccc 814 
Trp Leu Phe Pro Asp Gly Asp Asn Gly Lys Glu Pro Glu Pro Glu Pro 
20 ~ 25 30 35 

gag ccg ace gtc gag ctg tgc gga cgc tgg gac gcg cgc gat gtg gee 862 
Glu Pro Thr Val Glu Leu Cys Gly Arg Trp Asp Ala Arg Asp Val Ala 
40 45 50 

999 999 cgc tac egg gtg ate aac aac gta tgg ggc gcg gag ace gec 910 
Gly Gly Arg Tyr Arg Val lie Asn Asn Val Trp Gly Ala Glu Thr Ala 
55 ~ 60 65 

cag tgc att gag gtc gga ctg gaa acg ggc aac ttc acg ate aca egg 95 8 

Gin Cys He Glu Val Gly Leu Glu Thr Gly Asn Phe Thr He Thr Arg 
70 75 80 

gec gat cac gac aac ggc aac aac gtg gec gee tat ccg gee ate tac 1006 
H' Ala Asp His Asp Asn Gly Asn Asn Val Ala Ala Tyr Pro Ala He Tyr 
O 85 90 95 

o 

H ttc ggg tgc cac tgg ggc gec tgc acg age aat teg gga ttg ccg egg 1054 

jTi Phe Gly Cys His Trp Gly Ala Cys Thr Ser Asn Ser Gly Leu Pro Arg 
100 ^ ~ 105 110 115 

IH cgc gtg cag gag ctg tec gac gtg cgc acg age tgg acg etc acg ccg 1102 
ijD Arg Val Gin Glu Leu Ser Asp Val Arg Thr Ser Trp Thr Leu Thr Pro 
,J 12 0 12 5 13 0 

U 

!:«. ate acg acg ggc cgc tgg aat gee gec tac gac ate tgg ttc agt ccc 1150 

;:r! He Thr Thr Gly Arg Trp Asn Ala Ala Tyr Asp He Trp Phe Ser Pro 

FU 135 ~ 140 145 



is i 



gtc acg aat tec ggc aac ggc tac age ggc ggc gee gag ctg atg ate 1198 
Val Thr Asn Ser Gly Asn Gly Tyr Ser Gly Gly Ala Glu Leu Met He 
150 155 160 

tgg ctg aac tgg aac ggc ggc gtg atg ccg ggc ggc age cgc gtg gee 1246 
Trp Leu Asn Trp Asn Gly Gly Val Met Pro Gly Gly Ser Arg Val Ala 
165 170 175 

ace gtg gaa ctg gee ggg gee ace tgg gaa gtc tgg tat gec gac tgg 12 94 

Thr Val Glu Leu Ala Gly Ala Thr Trp Glu Val Trp Tyr Ala Asp Trp 
180 185 190 195 

gac tgg aat tac ate gec tac egg cgc acg acg ccc ace acg teg gtg 1342 
Asp Trp Asn Tyr He Ala Tyr Arg Arg Thr Thr Pro Thr Thr Ser Val 
200 205 210 

age gag ctg gac ctg aag gee ttc ate gac gac gcg gtc gee cgc ggc 13 90 

Ser Glu Leu Asp Leu Lys Ala Phe He Asp Asp Ala Val Ala Arg Gly 
215 " 220 225 

tac ate egg ccg gag tgg tat ctg cat gcg gtg gag acg ggc ttc gaa 1438 
Tyr He Arg Pro Glu Trp Tyr Leu His Ala Val Glu Thr Gly Phe Glu 
230 235 240 



Li 



\ : ! 

o 

I. J. 
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etc tgg gag ggc ggg gec ggt ctg cga age gee gat ttt tec gta acg 1486 
Leu Trp Glu Gly Gly Ala Gly Leu Arg Ser Ala Asp Phe Ser Val Thr 
245 250 255 

gtg cag tag cctgtcacac gggcaccagc gtaggecaga gaagcacccg 1535 



Val Gin 
260 














teggggeget 


tatgegggeg 


gccgtcgctt 


tgtgcctgac 


tttgtagtgc 


getaeggagg 


1595 


cgtcagccgg 


cgtggtgcgt 


ttccctggag 


gcgcttcgtt 


tcgtgccgga 


cgagaactta 


1655 


tccgaaaagc 


aagggagaga 


cctggtaagc 


cagggctttg 


cactgcgcac 


cagaagcacg 


1715 


ggtttgtagt 


aagccacgaa 


ggcgtccgcc 


gaagtggcct 


cacccgcgac 


ggcacttcgc 


1775 


tgcgctccgt 


gcctgactac 


gagcgcgttg 


gccgtgcttc 


aaaeggcagg 


ctataaagct 


1835 


atacaggtcg 


aaatcgcgaa 


cttctccgac 


acttacaaac 


ac 




1877 



U: <210> 2 

Q <211> 261 

j!2J <212> PRT 

jiSJ <213> Rhodothermus marinus 

W <400> 2 

'Si 

Lfl Met Asn Val Met Arg Ala Val Leu Val Leu Ser Leu Leu Leu Leu Phe 
15 10 15 



! Gly Cys Asp Trp Leu Phe Pro Asp Gly Asp Asn Gly Lys Glu Pro Glu 
Q ' ~ 20 25 30 



Pro Glu Pro Glu Pro Thr Val Glu Leu Cys Gly Arg Trp Asp Ala Arg 
35 40 45 



Asp Val Ala Gly Gly Arg Tyr Arg Val He Asn Asn Val Trp Gly Ala 
50 55 ~ 60 



Glu Thr Ala Gin Cys He Glu Val Gly Leu Glu Thr Gly Asn Phe Thr 
65 70 75 80 



He Thr Arg Ala Asp His Asp Asn Gly Asn Asn Val Ala Ala Tyr Pro 
85 90 95 



Ala He Tyr Phe Gly Cys His Trp Gly Ala Cys Thr Ser Asn Ser Gly 
100 105 110 



Leu Pro Arg Arg Val Gin Glu Leu Ser Asp Val Arg Thr Ser Trp Thr 
115 " 120 125 
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Leu Thr Pro lie Thr Thr Gly Arg Trp Asn Ala Ala Tyr Asp lie Trp 
130 135 140 



Phe Ser Pro Val Thr Asn Ser Gly Asn Gly Tyr Ser Gly Gly Ala Glu 
145 150 155 160 



Leu Met lie Trp Leu Asn Trp Asn Gly Gly Val Met Pro Gly Gly Ser 
165 ~ 170 175 



Arg Val Ala Thr Val Glu Leu Ala Gly Ala Thr Trp Glu Val Trp Tyr 
180 185 190 



Ala Asp Trp Asp Trp Asn Tyr lie Ala Tyr Arg Arg Thr Thr Pro Thr 
195 200 205 



Thr Ser Val Ser Glu Leu Asp Leu Lys Ala Phe lie Asp Asp Ala Val 
210 215 220 



i :=: i Ala Arg Gly Tyr lie Arg Pro Glu Trp Tyr Leu His Ala Val Glu Thr 
;7 { 225 " " 230 235 240 



Gly Phe Glu Leu Trp Glu Gly Gly Ala Gly Leu Arg Ser Ala Asp Phe 



ht\ 245 250 255 



Ser Val Thr Val Gin 
260 



o 
m 

□ <210> 3 

12 <211> 786 

<212> DNA 

<213> Rhodothermus marinus 

<400> 3 



atgaacgtca 


tgcgtgcggt 


actggtcctg 


agcctgttat 


tgctgtttgg 


atgcgactgg 


60 


ctctttcccg 


atggcgacaa 


cggaaaggaa 


ccggagcctg 


agcccgagcc 


gaccgtcgag 


120 


ctgtgcggac 


gctgggacgc 


gcgcgatgtg 


gccggggggc 


gctaccgggt 


gatcaacaac 


180 


gtatggggcg 


cggagaccgc 


ccagtgcatt 


gaggtcggac 


tggaaacggg 


caacttcacg 


240 


atcacacggg 


ccgatcacga 


caacggcaac 


aacgtggccg 


cctatccggc 


catctacttc 


300 


gggtgccact 


ggggcgcctg 


cacgagcaat 


tcgggattgc 


cgcggcgcgt 


gcaggagctg 


360 


tccgacgtgc 


gcacgagctg 


gacgctcacg 


ccgatcacga 


cgggccgctg 


gaatgccgcc 


420 


tacgacatct 


ggttcagtcc 


cgtcacgaat 


tccggcaacg 


gctacagcgg 


cggcgccgag 


480 


ctgatgatct 


ggctgaactg 


gaacggcggc 


gtgatgccgg 


gcggcagccg 


cgtggccacc 


540 
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gtggaactgg ccggggccac 
gcctaccggc gcacgacgcc 
gacgacgcgg tcgcccgcgg 
ggcttcgaac tctgggaggg 
cagtag 



ctgggaagtc tggtatgccg 
caccacgtcg gtgagcgagc 
ctacatccgg ccggagtggt 
cggggccggt ctgcgaagcg 



actgggactg gaattacatc 
tggacctgaa ggccttcatc 
atctgcatgc ggtggagacg 
ccgatttttc cgtaacggtg 



<210> 4 

<211> 1134 

<212> DNA 

<213> Pyrococcus furiosus 
<220> 

<221> exon 

<222> (82) . . (1041) 

<223> 

<400> 4 

gaatgtcggg gcagattacg aagagagaag agctcttttt tataccttat attcaaatct 

aacaaatttg gaggatgtcc a atg age aag aaa aag ttc gtc ate gta tct 

Met Ser Lys Lys Lys Phe Val lie Val Ser 
15 10 

ate tta aca ate ctt tta gta cag gca ata tat ttt gta gaa aag tat 
lie Leu Thr lie Leu Leu Val Gin Ala lie Tyr Phe Val Glu Lys Tyr 
15 20 25 

cat acc tct gag gac aag tea act tea aat acc tea tct aca cca ccc 
His Thr Ser Glu Asp Lys Ser Thr Ser Asn Thr Ser Ser Thr Pro Pro 
30 35 40 

caa aca aca ctt tec act acc aag gtt etc aag att aga tac cct gat 
Gin Thr Thr Leu Ser Thr Thr Lys Val Leu Lys lie Arg Tyr Pro Asp 
45 50 55 

gac ggt gag tgg cca gga get cct att gat aag gat ggt gat ggg aac 
Asp Gly Glu Trp Pro Gly Ala Pro lie Asp Lys Asp Gly Asp Gly Asn 
60 ~ 65 70 

cca gaa ttc tac att gaa ata aac eta tgg aac att ctt aat get act 
Pro Glu Phe Tyr lie Glu lie Asn Leu Trp Asn lie Leu Asn Ala Thr 
75 80 85 90 

gga ttt get gag atg acg tac aat tta acc age ggc gtc ctt cac tac 
Gly Phe Ala Glu Met Thr Tyr Asn Leu Thr Ser Gly Val Leu His Tyr 
95 100 105 

gtc caa caa ctt gac aac att gtc ttg agg gat aga agt aat tgg gtg 
Val Gin Gin Leu Asp Asn lie Val Leu Arg Asp Arg Ser Asn Trp Val 
110 115 120 



cat gga tac ccc gaa ata ttc tat gga aac aag cca tgg aat gca aac 
His Gly Tyr Pro Glu lie Phe Tyr Gly Asn Lys Pro Trp Asn Ala Asn 
125 130 135 
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tac gca act gat ggc cca ata cca tta ccc agt aaa gtt tea aac eta 
Tyr Ala Thr Asp Gly Pro lie Pro Leu Pro Ser Lys Val Ser Asn Leu 
140 145 150 

aca gac ttc tat eta aca ate tec tat aaa ctt gag ccc aag aac gga 
Thr Asp Phe Tyr Leu Thr lie Ser Tyr Lys Leu Glu Pro Lys Asn Gly 
155 ^ 160 165 170 

ctg cca att aac ttc gca ata gaa tec tgg tta acg aga gaa get tgg 
Leu Pro lie Asn Phe Ala lie Glu Ser Trp Leu Thr Arg Glu Ala Trp 
175 180 185 

aga aca aca gga att aac age gat gag caa gaa gta atg ata tgg att 
Arg Thr Thr Gly lie Asn Ser Asp Glu Gin Glu Val Met lie Trp lie 
190 195 200 

tac tat gac gga tta caa ccg get ggc tec aaa gtt aag gag att gta 
Tyr Tyr Asp Gly Leu Gin Pro Ala Gly Ser Lys Val Lys Glu lie Val 
2 05 210 215 

gtc cca ata ata gtt aac gga aca cca gta aat get aca ttt gaa gta 
Val Pro lie lie Val Asn Gly Thr Pro Val Asn Ala Thr Phe Glu Val 
220 225 230 

tgg aag gca aac att ggt tgg gag tat gtt gca ttt aga ata aag ace 
Trp Lys Ala Asn lie Gly Trp Glu Tyr Val Ala Phe Arg lie Lys Thr 
235 " 240 245 250 

cca ate aaa gag gga aca gtg aca att cca tac gga gca ttt ata agt 
Pro lie Lys Glu Gly Thr Val Thr lie Pro Tyr Gly Ala Phe lie Ser 
255 260 265 

gtt gca gee aac att tea age tta cca aat tac aca gaa ctt tac tta 
Val Ala Ala Asn lie Ser Ser Leu Pro Asn Tyr Thr Glu Leu Tyr Leu 
270 275 280 

gag gac gtg gag att gga act gag ttt gga acg cca age act ace tec 
Glu Asp Val Glu lie Gly Thr Glu Phe Gly Thr Pro Ser Thr Thr Ser 
285 290 295 

gee cac eta gag tgg tgg ate aca aac ata aca eta act cct eta gat 
Ala His Leu Glu Trp Trp lie Thr Asn lie Thr Leu Thr Pro Leu Asp 
300 ^ 305 310 

aga cct ctt att tec taa attteggcaa cctgggaatt atcaagttta 

Arg Pro Leu lie Ser 

315 

agaaaaggtg gagttgctaa agaattcaaa gaaaatttga aaagtaactt ttattgtgat 
etc 



<210> 5 

<211> 319 

<212> PRT 

<213> Pyrococcus furiosus 
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<221> SIGNAL 

<222> (1) . . (19) 
<223> 

<220> 

<221> DOMAIN 

<222> (28) . . (49) 

<223> Linker Moiety 

<220> 

<221> DOMAIN 

<222> (50) . . (319) 

<223> Catalytic Domain 

<400> 5 

Met Ser Lys Lys Lys Phe Val lie Val Ser lie Leu Thr lie Leu Leu 
15 10 15 



Val Gin Ala lie Tyr Phe Val Glu Lys Tyr His Thr Ser Glu Asp Lys 
20 25 30 



Ser Thr Ser Asn Thr Ser Ser Thr Pro Pro Gin Thr Thr Leu Ser Thr 
35 40 45 



Thr Lys Val Leu Lys lie Arg Tyr Pro Asp Asp Gly Glu Trp Pro Gly 
50 55 * 60 



Ala Pro lie Asp Lys Asp Gly Asp Gly Asn Pro Glu Phe Tyr lie Glu 
65 70 75 80 



lie Asn Leu Trp Asn lie Leu Asn Ala Thr Gly Phe Ala Glu Met Thr 
85 90 95 



Tyr Asn Leu Thr Ser Gly Val Leu His Tyr Val Gin Gin Leu Asp Asn 
100 105 110 



lie Val Leu Arg Asp Arg Ser Asn Trp Val His Gly Tyr Pro Glu lie 
115 120 125 



Phe Tyr Gly Asn Lys Pro Trp Asn Ala Asn Tyr Ala Thr Asp Gly Pro 
130 135 140 



lie Pro Leu Pro Ser Lys Val Ser Asn Leu Thr Asp Phe Tyr Leu Thr 
145 150 155 160 



lie Ser Tyr Lys Leu Glu Pro Lys Asn Gly Leu Pro lie Asn Phe Ala 
165 170 175 
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lie Glu Ser Trp Leu Thr Arg Glu Ala Trp Arg Thr Thr Gly lie Asn 
180 185 190 



Ser Asp Glu Gin Glu Val Met lie Trp lie Tyr Tyr Asp Gly Leu Gin 
195 200 205 



Pro Ala Gly Ser Lys Val Lys Glu He Val Val Pro He He Val Asn 
210 215 220 



Gly Thr Pro Val Asn Ala Thr Phe Glu Val Trp Lys Ala Asn He Gly 
225 230 235 240 



Trp Glu Tyr Val Ala Phe Arg He Lys Thr Pro He Lys Glu Gly Thr 
245 250 255 



Val Thr He Pro Tyr Gly Ala Phe He Ser Val Ala Ala Asn He Ser 
260 265 270 



Ser Leu Pro Asn Tyr Thr Glu Leu Tyr Leu Glu Asp Val Glu He Gly 
275 280 285 



Thr Glu Phe Gly Thr Pro Ser Thr Thr Ser Ala His Leu Glu Trp Trp 
290 295 300 



He Thr Asn He Thr Leu Thr Pro Leu Asp Arg Pro Leu He Ser 

315 



305 


310 


<210> 


6 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


HSV-Tag 


<400> 


6 



Ser Gin Pro Glu Leu Ala Pro Glu Asp Pro Glu Asp 
15 10 



<210> 


7 


<211> 


29 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Oligonucleotide 


<400> 


7 



aggactccat atgaccgtcg agctgtcgg 



<210> 8 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 8 

acctgagaag cttctgcacc gttacgga 



<210> 9 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 9 

agctccatat gtgcgactgg ctctttcc 



<210> 10 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<400> 10 

acctgagaag cttctgcacc gttacgga 



